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ABSTRACT 

The caustic technique measures the mass of galaxy clusters in both their virial and infall regions and, 
as a byproduct, yields the list of cluster galaxy members. Here we use 100 galaxy clusters with mass 
-^200 > 10^'^h~^ Mq extracted from a cosmological A^-body simulation of a ACDM universe to test 
the ability of the caustic technique to identify the cluster galaxy members. We identify the true three- 
dimensional members as the gravitationally bound galaxies. The caustic technique uses the caustic 
location in the redshift diagram to separate the cluster members from the interlopers. We apply the 
technique to mock catalogues containing 1000 galaxies in the field of view of 12h~^ Mpc on a side at the 
cluster location. On average, this sample size roughly corresponds to 180 real galaxy members within 
3r2oo, similar to recent redshift surveys of cluster regions. The caustic technique yields a completeness, 
the fraction of identified true members, fc = 0.95 ± 0.03 within 3r2oo- The contamination, the 
fraction of interlopers in the observed catalog of members, increases from fi = 0.020^0 0^5 at r2oo 



to fi 



0.08 



+0.11 
-0.05 



at 3r2oo- No other technique for the identification of the members of a galaxy 



cluster provides such large completeness and small contamination at these large radii. The caustic 
technique assumes spherical symmetry and the asphericity of the cluster is responsible for most of the 
spread of the completeness and the contamination. By applying the technique to an approximately 
spherical system obtained by stacking the individual clusters, the spreads decrease by at least a factor 
of two. We finally estimate the cluster mass within 3r2oo after removing the interlopers: for individual 
clusters, the mass estimated with the virial theorem is unbiased and within 30 per cent of the actual 
mass; this spread decreases to less than 10 per cent for the spherically symmetric stacked cluster. 

Keywords: gravitation - galaxies: clusters: general - methods: data analysis - techniques: miscel- 
laneous - cosmology: miscellaneous - cosmology: dark matter - cosmology: large-scale 



structure of Universe 



1. INTRODUCTION 



Galaxy clusters provide crucial information to our un- 
derstanding of the large-scale cosmic structure and to 
constrain cosmological models. They populate the high- 
mass tail of the mass function of virialiscd galaxy sys- 
tems, and their abundance and redshift distribution de- 
pend on the average density of the Universe and the 
normalization of the po wer spectrum of the initial den- 
sity pert urbations (e.g.. iVoitl [20051 iDiaferio et all 120081 : 
iBorganil [2001) . Clusters are a hostile environment to 
galaxies and are thus also a unique tool to investigate 
the connec tion between environment and galaxy prop- 
erties (e.g..lDommguez et al.ll2001l: iMartmez et al.ll2008l : 



iSkibba et al.ll2009t iHuertas-Companv et al.ll2009i r 

Separating the galaxies that do actually belong to the 
cluster from the interlopers, the galaxies that happen to 
lie in the field of view but are not dynamically linked to 
the cluster, is crucial to derive accur ate estimates of th e 
cluster properties, including its mass (jPerea et al.l[T990[ ). 
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or the color and star formation gradients of its galaxy 
population ([Diaferio et al.|[2"001aD . 

Interloper rejection techniques are numerous and their 
sophistication has progressively increased over the years, 
thanks to the increased quality and richness of the ob- 
servational data: in fact, over the last decade, the hand- 
ful of clusters with tens of measured redshifts within ~ 
1— 2/i~^Mpc of the cl uster center has increased by at least 
a fac t or of ten (e.g.. iRines et al.l[2003[ : iRines fc Diaferiol 
[2006tlGeller et al.ll20lH) . 

Early observations of galaxy clusters do not usually ex- 
tend into the outer regions of the system and early inter- 
loper rejection techniques identify galaxy members solely 
on the basis of their redshift separation from the clus- 
ter center. The gravitational potential well can however 
become substantially shallower at increasing radius and 
the combination of velocity and radial distance is now an 
essential ingredient for the identification of galaxy mem- 
bers in samples that extend to the cluster virial radius 
and beyond. 

The caustic technique ([Diaferio fc Geller[ 119971 : 
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IDiaferiol [l999l [2001 ISerra et all [Ml identifies the 
escape velocity profile of galaxy clusters from their 
center to radii as large as 3r2oo, where r2oo is the radius 
of the sphere whose average density is 200 times the 
critical density of the Universe. The technique was 
thus applied to estimate the gravitational potential 
well and the mass profiles of galaxy clusters to radii 
that extend t o th e cluster infall re gion (see reviews in 
IDiaferiol [20091 and ISerra et al.ll20lH ). Where the cluster 
is in the appropriate redshift range for weak lensing 
mass estimation and a comparison is thus possible, 
caustic and lensing masses show reasonable agreement 
(jPiaferio et alj |2005[) and the systematic errors of the 
two techniques can be assessed (jGeller et al.ll2012) ). 

Because the technique measures the escape velocity 
profile, a byproduct of the caustic procedure is the iden- 
tification of interlopers. Compared to other interloper 
rejection algorithms, the caustic technique has two ma- 
jor advantages: (1) it does not require the system to be 
in dynamical equilibrium, and (2) it does not rely on the 
derivation of the cluster mass profile to remove interlop- 
ers. These advantages enable the technique to identify 
interlopers both in the central and in the outer regions 
of clusters, where other techniques can not be applied. 
The caustic technique only assumes spherical symmetry, 
an assumption that is common to most methods. 

The caustic technique as an interloper rejection al- 
gorithm, or some simplified versions of it, was ap- 
plied to real clusters to investigate the dependence 
of ga la xy pr o pertie s on environment (e.g.. [ Rines etaP 
2000, [200l l200l [Mahaian fc Ravchaudhurvl l200£ 



Hernandez-Fernandez et al. I I2012t iHwang et all 12012! 



and to provide robust estimates of the cluster ve- 
locity disp ersion and mass_ Ce g.. iBenatov et al.l 120061 : 
iLemze et a l. 2009; Zhang e t al.ll201lL 120121 ). 

Thanks to the approximate self-similarity of self- 
gravitating systems, the technique c an also be applied to 
reject stellar interlopers in galaxies: iBrown et al.l ()2010[ ) 
used the caustic method results to estimate the velocity 
dispersion profile of the stars in the Milky Way halo, and 
ISerra et al.l (j2010f ) demonstrated that a proper stellar in- 
terloper rejection alleviates the tension between the in- 
ternal velocity dispersion profiles of the Milky Way dwarf 
satellites a nd the expectat i ons o f Modified Newtonian 
Dynamics. lYegorova et al.l (|201lD also probed the dark 
matter distribution in the outer regions of disk galaxies 
by identifying their satellites with the caustic technique. 

Despite this extensive application, the caustic tech- 
nique has never been exhaustively explored as a method 
to identify interlopers. Here, we provide a thorough anal- 
ysis of its performance, and of its random and systematic 
errors. In Section [2| we bricfiy describe the caustic tech- 
nique, and in Scction[3]we present the mock cluster cata- 
logs. In Section m we discuss the technique performance. 
We finally investigate the impact of our interloper re- 
jection on the cluster mass estimates in Section \5\ We 
compare the performance of our method with other re- 
jection techniques in Section [6| and conclude in Section 

m 

2. THE CAUSTIC TECHNIQUE 

In hierarchical clustering, clusters of galaxies form by 
the aggregation of smaller systems: t he accretion is not 
purely radial (e.g.. IWhite et al.ll2010[) . because galaxies 



within the falling clumps have velocities with a substan- 
tial non-radial component. Therefore, the galaxy veloc- 
ities are set by the local gravitational potential more 
than by the radial infall expected i n the spherical col- 
lapse model (jDiaferio fc Gellei][T997h . 

When observed in the redshift diagram, the plane of 
the line-of-sight velocity v of the galaxies in the cluster 
rest frame vs. their projected distance r from the clus- 
ter center, the cluster members populate a region with a 
trumpet shape approximately symmetric along the r axis 
(jKaiser 1987; Rcgos & Gellcr 1989). The caustics de- 
fine the boundaries of this regi on whose amplitud e A(r) 
decreases with increasing r. IDiaferio fc Gelled ([l997[ ) 
demonstrate that A{r) is a combination of the profile 
of the escape velocity from the cluster and the profile 
of the velocity anisotropy parameter P(r) ~ 1 — ((wg) + 
(i'^))/2(i'^), where vg, w^, and Vr are the longitudinal, 
azimuthal and radial components of the velocity v of a 
galaxy, respectively, and the brackets indicate an aver- 
age over the velocities of the galaxies in the volume d'^r 
centered on position r. 

In a spherically symmetric system, the average square 
of the velocity of the system members at radius r is 
(^v^) = {vf^^)g{l3) where (vf^^) is the component of the 
line-of-sight velocity and 



3 - 2/3(r) 



(1) 



For the escape velocity at radius r, we have (Wesc(^)} = 
—2(j){r), where (j){r) is the gravitational potential. If the 
amplitude A{r) measures the average component along 
the line of sight of the escape velocity at radius r, namely 



csc.los 



), we obtain the relation 



2(b{r)^A^{r)g{f3)=cj)p{r)g{(3) . 



(2) 



The gravitational potential profile is related to the 
caustic amplitude by the functio n q(l3). Ho wever, q{P) is 
a slowly chan g:ing function of r (jDiaferio fc Gellcr 1993 
IDiaferiol [T99liSerra et all [Ml . and it can be well ap- 
proximated by a constant. Thanks to the relation be- 
tween the gravitational potential and the mass, the mea- 
sure of A{r) can thus provide the estimate of both the 
escape velocity pr ofile from the cluster and the mass pro - 
file of the cluster (jDiaferio fc Gelleri[T997t IDiaferiol[T999l ) . 

To measure A{r) we need to locate the caustics in the 
redshift diagram. The technique consists of three major 
steps: (1) the construction of a binary tree based on the 
projected galaxy pairwise energy; (2) the determination 
of a threshold to cut the binary tree; (3) the identification 
of the cluster center to obtain the redshift diagram and 
determine the galaxy number density on this diagram. 

At the first step, all the galaxies are arranged in a bi- 
nary tree according to their pairwise binding energy Eij , 
estimated with the pair projected separation and line- 
of-sight velocity difference. The binary tree is built as 
follows: (i) initially each galaxy is a group ga] (ii) the 
binding energy Eajs = min{Eij}, where Eij is the bind- 
ing energy between the galaxy i € ga and the galaxy 
j & gp, is associated to each group pair ga-^gp] (iii) the 
two groups with the smallest binding energy Eap are 
replaced with a single group g^ and the total number of 
groups is decreased by one; (iv) the procedure is repeated 
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Figure 1. Dendrogram representation of the binary tree of a random subsample of 200 particles in the field of view of a simulated cluster. 
The particles arc the leaves of the tree at the bottom of the plot. The particles within 3r2oo in real space are highlighted in black. The 
thick path highlights the main branch of the tree. The horizontal lines show the levels at the two nodes xi (upper line) and X2 (lower line) 
that limit the cr plateau shown in Figure [3] The upper node xi is the threshold where the tree is cut and the main group is the structure 
hanging from this node. Only as a guide, some nodes are labelled on the left-hand side, with their number of associated particles, the 
descendants, in brackets. 



from step (ii) until only one group is left. Figure [T] shows 
the binary tree of a random sample of 200 particles ex- 
tracted from a simulated halo selected from the N-body 
simulation described in the next section, whereas Figure 
[2] shows the celestial coordinates of the same particles 
with the same color code as in Figure [TJ 

The second step of the caustic technique procedure is 
the threshold choice. The tree arranges the galaxies in 
potentially distinct groups; however, to get effectively 
distinct groups and to specifically define the set of candi- 
date members, we need to cut the tree at some level. This 
level sets the node from which the candidate members 
hang. All these candidate members do not necessarily 
coincide with the optimal members that are determined 
by the caustic location. Below we will extensively illus- 
trate the reason for this distinction between candidate 
and optimal members. 

In order to choose the threshold to cut the binary tree, 
we identify the main branch as the branch that emerges 
from the root and contains the nodes from which, at each 
level, the largest number of galaxies (or leaves) hang. 



The leaves hanging from each node x of the main branch 
provide a velocity dispersion uj^g. When walking along 
the main branch from the root to the leaves, crj^g rapidly 
decreases due to the progressive loss of galaxies that are 
most likely not associated with the cluster (Figure |3]) ; 
then crj^g reaches a "ct plateau" at some node xi. Most 
of the galaxies hanging from this node are members; in 
fact, the system is nearly isothermal and the removal of 
the less bound galaxies does not affect the value of crj^g. 
At some point of the walk along the main branch, the 
loss of the most bound galaxies, whose binding energy 
is very small, causes crj^g to drop again. This second 
rapid drop identifies the nodes X2 which sets the limit of 
the cr plateau. The first node xi closest to the root is 
the appropriate level for the identification of the system 
and we define the galaxies hanging from it the candidate 
members of the cluster. They determine the center of 
the system, its radius, and its line-of-sight velocity dis- 
persion. These quantities arc used to build the redshift 
diagram. 

The third step of the procedure is the location of the 
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Figure 2. Celestial coordinates of the subsamplc of 200 particles 
in the field of view of the simulated cluster whose binary tree is 
shown in Figure [T] The color code is the same as in Figure [T] 
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Figure 3. Velocity dispersion of the leaves of each node along 
the main branch of the binary tree shown in Figure [T] The square 
and the triangle show the nodes xi and X2 respectively. The curve 
between xi and X2 is the a plateau. The dashed line shows the 
line-of-sight velocity dispersion of the particles within the sphere 
of radius 3r200. 

caustics in the redshift diagram. The caustics are the 
curves satisfying the equation fq{r, v) = k. Here fq{r, v) 
is the galaxy number density in the redshift diagram, 
namely the plane (r, u), and k is the root of the equation 



(3) 



The function 
is the mean 



(Wcsc)k.h = Al{r)ip{r)dr/ ip{r)dr 
caustic amplitude within R, ipir) ~ 

J fqi'''-, v)dv, {v^y/^ is the velocity dispersion of the can- 
didate members, R is their mean projected separation 
from the center, and q is a smoothing parameter (see 
iDiaferio fc GeIIe3fT997l: iSerra et al.|[201ll for details). 

Figure |4] shows the result of this procedure on the red- 
shift diagram. 

3. SIMULATED CLUSTERS AND MOCK CATALOGS 
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Figure 4. Redshift diagram of the subsample of 200 particles 
in the field of view of the simulated cluster whose binary tree is 
shown in Figure [l] The black lines with error bars are the caus- 
tics located by the caustic technique. The cyan lines are the real 
caustics determined by the profiles of the escape velocity and the 
velocity anisotropy parameter derived by the three-dimensional in- 
formation. The dots show the particles in the catalog and the color 
code is as in Figure [T] 

We use t he sy nthetic galaxy clusters described in 
ISerra et al.l (1201 ID selected from the N-body simulation 
of iBorgani et air()2004j ). The simulation models a cu- 
bic volume of 192 h-^ Mpc on a side of a flat ACDM 
model, with matter density = 0.3, Hubble param- 
eter h = 0.7, normalization of the power spectrum 
(78 = 0.8, and baryon density fib = 0.02h~^. The sim- 
ulation contains 480'^ dark matter particles with mass 
muM = 4.6 X 10^h~^ M© and, initially, 480^^ gas par- 
ticles with mass TO<xas = 6.9 x 10 ^h~^ simu- 
lation was run with GADGET-2 (iSpringell [20051 ). Fur- 
ther details of the s imulations and the d ark matter halo 
identification are in IBorgani et all (|2004f ). In the follow- 
ing, we limit our analysis to the gravitational dynamics 
of the dark matt er distribution . In fa c t, both TV-body 
simulations (e.g.. IDiaferio et al ll20Qlbl: IGill et al.ll200l 
iDiemand et all 120041: iGill et al.ll2Q05D and observations 
(e.g.. iRines et al.l 120081 ) indicate that any velocity bias 
between galaxies and dark matter is negligible. 

We consider the 100 dark matter halos with mass M(< 
r2oo) = M200 > IO^^/i-^Mq at redshift z = 0. We locate 
each halo at {a,S) = (6'*,0°) and redshift cz = 32000 
km s^^. We simulate the compilation of the redshift cat- 
alog of a galaxy cluster by projecting each halo along ten 
random lines of sight. For each of these lines of sight, we 
choose two additional directions orthogonal to the first 
one and to each other. We end up with 3000 mock red- 
shift catalogs. Each catalog contains a random sample 
of 1000 particles distributed within a rectangular paral- 
lelepiped centered on the cluster with a squared field of 
view of 12h~^ Mpc on a side and 192 Mpc deep. 
With this number of particles in the field of view, we 
obtain a distribution of the number of particles within 
the sphere of radius r2oo that has median 101 and per- 
centile range [10%, 90%] equal to [51,226]; the median 
number of particles within 3r2oo is 185 and the percentile 
range [10%, 90%] is [96,408]. These numbers arc com- 
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parable to the sample sizes of recent large galaxy red- 
shift surveys of clusters and their sur roundings, like CI RS 
(|Rines fc Diaferiol 120061) and HeCS (iRines et alJl20T2l) . 

The binary tree algorithm applied to the individual 
mock catalogs gives a center of the cluster and a veloc- ^ 
ity dispersion of the candidate members. The center and ^ 
velocity dispersion determined with the bin ary tree are 
close to the correct quantities in most cases (jSerra et alj ^ 
1201 It) . Specifically, in 2678 mock catalogs (89 per cent ^ 
of the cases) the algorithm locates the center on the ex- oj 
pected cluster; in the remaining 11 per cent of the cases, oo 
the field of view is particularly crowded with numerous S 
groups and clusters, and the cluster of interest might 
not be the most massive cluster in the field. In these 2 
cases, the algorithm identifies the center of a different 
cluster. In a similar situation happening with catalogs 
of real clusters we will relocate the center on the cluster 
of interest. Here, we simply remove these problematic 
catalogs. Among the 2678 correctly identified clusters, 
the estimated velocity dispersion within 3r2oo is within 
5 (30) per cent of the real one in 50 (95) per cent of 
the systems; the center deviations are smaller than 0.07° 
on the sky and 250 km s~^ along the line of sight in 
90 per cent of the clusters. The largest discrepancies 
between the correct center and the center found by the 
algorithm occurs in systems with evident substructures 
that produce multiple peaks of the particle number den- 
sity distribution. When happening with catalogs of real 
clusters, these cases can yield off-centered rcdshift dia- 
grams. This problem can be removed by relocating the 
center on the most luminous galaxy of the cluster or on 
the peak of the X-ray emission. In our mock catalogs, 
we do not keep these systems, but further remove those 
catalogs where the center found by the algorithm has 
an offset greater than 0.5/i~^Mpc on the sky or greater 
than 400 km s~^ along the line of sight. Thus, the final 
number of mock catalogs reduces from 2678 to 2420. 

4. IDENTIFICATION OF CLUSTER MEMBERS 
4.1. Definition of members 

A galaxy is a cluster member if its binding energy is 
negative, namely if its velocity v is lower than the veloc- 
ity Vcsc (r) required to escape the cluster when the galaxy 
is at distance r from the cluster center: v"^ < v^^^{r). Fig- 
ure [5] shows the number density distribution of the dark 
matter particles in our mock cluster catalogs in the plane 
of binding energy vs. the three-dimensional clustrocen- 
tric distance. The plot includes the entire sample of 2420 
cluster catalogs. 

Figure [5] shows that a substantial fraction of bound 
parti cles have clustrocen tric distance much larger than 
r2oo (jWoitak et al.l [2b07f ). Therefore, in principle, we 
might use a different and simpler criterion to define a 
cluster member: a galaxy whose clustrocentric distance 
is smaller than, for example, 3'r2oo- This criterion is ac- 
tually more restrictive than the criterion based on the 
binding energy, as Figure [5] suggests. Nevertheless, we 
include this criterion in the following analysis, for the 
sake of comparison. Hereafter, we call 3D members these 
two sets of members defined on the basis of the three- 
dimensional information. 

We expect that the caustic technique identifies the 3D 
members as the galaxies within the caustics in the red- 
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Figure 5. Logarithm of the number density distribution of the 
particles in the plane of binding energy vs. clustrocentrie distance. 
The top bar shows the color scale, with the number density distri- 
bution normalized to the total number of particles in the diagram. 
Binding energies and clustrocentric distances are normalized to the 
three-dimensional velocity dispersion a and r2oo of each individual 
cluster respectively. The number density distribution includes all 
the particles from the entire sample of 2420 mock catalogs. 

shift diagram. In the following we will also consider a 
criterion based on the binary tree. In fact, as described 
in Sect. ^ the caustic technique arranges the galaxies 
in a binary tree according to their pairwise projected 
binding energy; by cutting the tree at the a plateau we 
define a set of candidate members. In the following anal- 
ysis, we show that the choice of this name is appropriate, 
because the interloper contamination of this set of can- 
didate members is larger than the contamination of the 
set of members determined by the caustic location. 

In conclusion, we consider two definitions of 3D mem- 
bers: (a) galaxies with negative binding energy; (b) 
galaxies within 3^200; and two possible criteria for their 
identification: (1) galaxies within the caustics in the red- 
shift diagram; (2) galaxies on the main branch of the bi- 
nary tree cut at the a plateau. Hereafter, we refer to 
the members identified with methods (1) or (2) as 2D 
members. 

In Sections l4.2l and [ 4 . 31 we focus on the performance of 
the first and second criterion respectively. We compute 
two relevant quantities: the completeness /c, which is 
the fraction of 3D members that are also identified as 2D 
members, and the contamination fi, which is the ratio 
between the number of particles taken as 2D members 
that are actually interlopers and the total number of 2D 
members. 

4.2. 2D members: Caustic location 

Figure [6] shows the redshift diagram of a cluster from 
our sample, with the bound galaxies defined as the 3D 
members, shown as blue dots. As expected, most of the 
3D members are within the caustics. 

To illustrate how the method performs on average, we 
compute the completeness and contamination profiles of 
each cluster. At each radius, we consider the median of 
the set of profiles and their dispersion. The upper left 
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Figure 6. The redshift diagram of 1000 particles of a syntiietic 
cluster. The black lines with l-cr error bars and the cyan lines 
are the estimated and true caustics respectively. The symbols are 
the particles in the catalog; the blue dots are the bound particles. 
There are 636 particles between the upper and the lower caustics. 

panel of Figure [7] shows the median differential profile 
of the completeness and the regions containing 50, 68 
and 90 per cent of the profiles. 

Only at small radii the caustic algorithm removes a few 
per cent of the 3D members, because the caustic ampli- 
tude is slightly underestimated, as can be seen in the 
example of Figure |6l The caustic criterion thus provides 
a completeness close to 0.95 at radii smaller than 0.2r2oo! 
and increases to 1.0 at larger radii. 

The median differential contamination is larger than 
0.1 at radii larger than 2r2oo (Figure [3 bottom left 
panel), but the cumulative contamination remains be- 
low 0.09 at 3r2oo (Figure [71 bottom right panel). Table 
[T] gives the corresponding 68 per cent levels for the cu- 
mulative profiles of fc and fi. 

Figure [5] reproduces the same redshift diagram of Fig- 
ure |6] with the red dots the particles within 3r2oo from 
the cluster center defined as the 3D members. In this 
case, most 3D members are within the caustics, but, 
at large radii, many particles within the caustics are 
not 3D members. In fact, in a sample of particles 
extracted from a spherical halo whose number density 
profile decreases with radius, the number of particles 
within a given 3D radius r observed in projection can 
fall to zero beyond a projected radius rg < r; obvi- 
ously, r-f, decreases with the size of the particle sample. 
For a iNavarro. Frenk. fc Whitd ()1997[ ) number density 
profile and for a sample of 500 particles within 3r2oo 
(the case that applies to the cluster in Figure [8|), we 
find that the number of particles with a 3D distance 
smaller than 3r2oo is zero at projected distances larger 
than Te = 2.4r2oo, on average; this result agrees with Fig- 
ureHlwhere r2oo = l.lSft-^ Mpc and re = 2.8/1"^ Mpc. 

As a consequence, the difi^erential contamination fi re- 
mains smaller than 0.1 within r2oo, on average, but in- 
creases dramatically at larger radii and reaches /i = 1 at 
r ^ 2.7r2oo- It follows that the corresponding cumulative 
profile of the fraction of galaxies identified as members 
when they are actually interlopers reaches 0.27 at 3r2oo 
(Table p. 

On the other hand, the profiles of the completeness 
we obtain with this definition of 3D members are very 



similar to Figure [T] the method yields a large 0.95) 
and stable completeness to radii as large as 3r2oo (Table 

4.3. 2D members: Main group of the binary tree 

We now evaluate the completeness fc and the contami- 
nation fi of the set of members identified with the binary 
tree. 

With the bound galaxies as the 3D members, the com- 
pleteness fc = 1.00 is constant over the entire range 
of r (Table [ij . The median differential contamination 
fi reaches a maximum value of ~ 0.4 at r > 2.5r2oo- 
The cumulative profile thus reaches the value 0.13 (Ta- 
ble [T]) at r — 37'200- The differential profile of fi shows 
that, if the bound galaxies are taken as 3D members, us- 
ing the binary tree procedure introduces interlopers at 
r > 0.8r2oo; this result is somewhat worse than the caus- 
tic location performance shown in the previous section, 
because the caustic location, on averge, provides sam- 
ples without contamination up to r = 1.5r2oo (Figure 
[7]). Overall, however, when we adopt the bound particles 
as 3D member, both the caustic location and the binary 
tree give high levels of completeness (fc ^ 0.95 — 1.0) 
and low levels of contamination {fi ^ 0.08 — 0.13) within 
3r2oo- 

In the case of the galaxies within 3r2oo as 3D members 
(definition (b) in Section |4?T|) . the completeness fc has a 
constant median value /c = 1.0 (Table[I]). Clearly, apply- 
ing the binary tree procedure to determine the members 
of a cluster guarantees an extremely high completeness 
of the sample. On the other hand, the median differen- 
tial contamination is smaller than 0.1 at r < r2oo, and 
increases at larger radii up to 1. This high contamination 
at large radii translates into a cumulative of 0.31 at 
Sr2oo (Table [1]). The reason for this large contamination 
at large radii derives from the decreasing of the number 
density profile, as discussed in Section 

4.4. Identification of members in stacked clusters 

As expected, the results listed in Table [1] indicate that 
the caustic location is more effective than the binary 
tree algorithm in identifying 3D members and that the 
binding energy criterion is more appropriate than the 
geometrical criterion to define members based on three- 
dimensional data. Table [T] also shows the spreads of the 
completeness and contamination. These spreads origi- 
nate from the random and systematic errors of the caus- 
tic technique, which are mostly due to the assumption of 
spherical symmetry (jSerra et all 120 lit) . In addition, we 
expect that the performance of the technique depends on 
the number of galaxies in the catalog. 

To quantify these effects, we stack our 3000 mock cat- 
alogs and randomly choose particles in the catalog until 
we obtain a given number TV of particles within 3r2oo 
from the cluster center in real space. The stacking was 
done by scaling the coordinates with r2oo and the three- 
dimensional velocity dispersion of each cluster. Here we 
show the results for two extreme cases with = 50 and 
A^ = 200. We compile 100 different catalogs for each 
value of A^ and we apply the procedure to determine the 
main group of the binary tree and to locate the caustics 
on the corresponding redshift diagrams. The complete- 
ness and contamination profiles are shown in the upper 
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Figure 7. Upper panels: Differential (left panel) and cumulative (right panel) median profiles (solid squares) of the completeness /c, 
where the 3D members are the bound galaxies and the 2D members are the galaxies within the eausties. Lower panels: Differential (left 
panel) and cumulative (right panel) profiles of the contamination The darkness of the shaded areas is proportional to the profile number 
density on the vertical axis. The dotted, dashed and dot-dashed lines limit the areas including the 50, 68, and 90 per cent of the profiles 
respectively. 
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1.000 
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1.000 


0.010 
0.034 
0.067 


0.031 
0.078 
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0.083 
0.156 
0.233 


0.990 
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0.980 


1.000 
1.000 
1.000 


1.000 
1.000 
1.000 


0.020 
0.073 
0.190 


0.050 
0.149 
0.306 


0.129 
0.274 
0.443 



Table 1 

Cumulative completeness /c and contamination fi at r200: 2r200i and 3r200i a^nd their l-tr dispersion, where the 3D members are the 
bound particles (left) or the particles within 3r2oo (right) and the 2D members are the particles within the caustics (top) or in the main 

group of the binary tree (bottom). 
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Figure 8. Same as Figure [G] with the red dots the 3D members 
defined as the particles within the sphere of radius 3r2oo ■ 



panels of Figure [9] for = 50 and in the lower panels for 
N = 200. We only show the case where the bound galax- 
ies are the 3D members, and the caustic location is used 
to select the 2D members. The profiles show that the 
effect of increasing TV from 50 to 200 is not significant on 
the median completeness and contamination profiles, but 
the associated spreads drop by at least a factor of 4 for 
the completeness and a factor of 2 for the contamination 
(Table 13). 

5. MASS ESTIMATION 

In this section, we analyze the effect of our interloper 
removal methods on the estimation of the mass, because 
interlopers have a non-negligible impact on the mass es- 
timation, especially at large radii, where interlopers can 
cause an overestimate o f the mass as large as a factor of 
three (|Perea et al.lll990( ). 

We consider the three standard methods described in 
iHeisler et all (|1985[ ): the virial, the average and the me- 
dian mass estimators. All estimators assume that the 
galaxies have equal mass and the system is in a steady 
state. The virial mass estimator is 



Ml 



VT 



3ttN V 



2 

los i 



2G J2i<j l/^-L,ii 



(4) 



where N is the number of galaxies with measured red- 
shifts, vios is the line-of-sight velocity and R±^ij is the 
projected separation between galaxy i and galaxy j. We 
do not add the surface pressure term in this analysis, be- 
cause the correction that it introduces is ex pected to be 
smaller than 10 per cent (jRines "eFaII[200l . 

The median mass estimator is supposed to be less 
sensitive to interlopers, because if interlopers in veloc- 
ity populate the tails of the distribution of the quantity 
(^^losi — viosj)'^R±,ij, which is estimated for each of the 
N{N — l)/2 pairs, the median of this quantity is a more 
robust estimate than the mean. The mass is thus 



M, 



Me 



G 



med{(uiosi - viosj)"^ R±,ij} , (5) 



where the co efficient /ms = 6.5 is calibrated with N-body 
simulations ([Heisler et al.|[T985l) . 
If we take the mean, rather than the median, we obtain 



May 



3/ 



AV 



GN{N - 1) 



Vlos j)'^Rl 



(6) 



i<j 



where /a v = 2.8 is again c alibrated with N-body sim- 
ulations (jHeisler et allll985|) . As mentioned above, one 



expects that the virial and average mass estimators are 
more sensitive to interlopers than the median mass esti- 
mator. 

We apply these three estimators to our samples after 
removing the interlopers with our two different proce- 
dures: (1) the caustic location and (2) the binary tree. 

The top panel of Figure [10] shows the results of ap- 
plying the mass estimators to individual clusters within 
r2oo, 2r2oo and 3r2oo- On average, the mass estimate is 
unbiased when the member galaxies are identified with 
the caustic location, whereas it is biased high by at least 
20 per cent when the member galaxies are extracted from 
the main group of the binary tree. This result confirms 
our expectation that the caustic location removes inter- 
lopers more efficiently than the binary tree procedure. 
As expected, the median mass estimator is the less sen- 
sitive method to the presence of interlopers: in fact, it 
yields the values of Afcst/Aftruo closest to one when the 
interlopers are removed with the less efficient binary tree 
procedure. 

For comparison, we also show the mass estimated with 
the caustic technique applied to the full sample of parti- 
cles, because, in principle, the technique is not affected 
by the presence of interlopers. With all the estimators, 
the spread increases with radius. For the caustic tech- 
nique, this increase derives from the smaller number of 
galaxies available for locating the caustics. For the other 
estimators the system is required to be in virial equilib- 
rium, that does not necessarily hold at radii larger than 
'"200; therefore, at these radii, the three standard estima- 
tors are more likely to return an incorrect mass. 

In addition, part of these spreads derives from the as- 
sumption of spherical symmetry. The bottom panels of 
Figure [10] show the mass estimates of the stacked clus- 
ters with iV = 50 and N = 200. The spreads for the 
case = 50 are slightly smaller than in the case of in- 
dividual clusters (upper panel of Figure [T0|). despite the 
fact that, on average, individual clusters have 185 galax- 
ies within 3r2oo, a factor of 3.7 larger than the iV = 50 
stacked cluster (bottom left panel of Figure [TO)) . In the 
stacked clusters, the assumption of spherical symmetry is 
basically correct, therefore the spread only derives from 
the sample size. In fact, the spreads further reduce by a 
factor of roughly 25 per cent in the case of the TV = 200 
stacked cluster (bottom right panel of Figure [TU]) . 

We finally note that in the case = 50, the mass 
estimate is biased low by 20 per cent. In this case, in 
fact, the number of galaxies within r2oo is only 27, on 
average, and the velocity field is too poorly sampled to 
return a correct mass. 

6. DISCUSSION 

Proper estimates of the mass of galaxy clusters and 
of the properties of their galaxy population depend on 
the accurate separation between the cluster members and 
those galaxies that appear projected in the cluster field 
of view but are not dynamically linked to the cluster. 
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Figure 9. Cumulative completeness fc (left panels) and contamination /; (right panels), where the 3D members are the bound galaxies 
and the 2D members arc the galaxies within the caustics, for the samples built from the stacked cluster with TV = 50 (upper panels) and 
A'^ = 200 (lower panels) particles within 3r200- The shaded areas and lines arc as in Figure [71 



Caustic 
location 



bound galaxies 

N = 50 Af = 200 

/c - O- fc fc + (T \ fi - (T fi /i + O- II /c - O- fc fc + (T \ fi - (J fi fi+CT 



r2oo 0.854 0.921 0.971 
2r2oo 0.812 0.898 0.959 
3r200 0.758 0.881 0.947 



0.000 0.026 0.059 
0.000 0.055 0.100 
0.022 0.076 0.129 



0.946 0.970 0.982 
0.945 0.967 0.981 
0.939 0.965 0.980 



0.021 0.034 0.052 
0.051 0.071 0.088 
0.080 0.106 0.128 



Table 2 

Cumulative completeness fc and contamination fi at r200i 2r2oo, and 3r2oo with their corresponding l-cr dispersion, with the bound 
particles as the 3D members and the particles within the caustics as the 2D members for the stacked cluster with = 50 (left) and 

N = 200 (right) particles within 3r200- 



Numerous methods to identify and remove interlop- 
ers in galaxy clusters have been suggested in the liter- 
ature. The algorithms are based either on the line-of- 
sight velocity separation of the galaxy from the cluster 
center alone or on both the velocity and the projected 
separations. The former class of algorithms is suited for 
galaxy samples that only survey the central regions of 
the clust ers. These alg orithms include the Str clipping 
method (|Yahil fc Vidall HOTTI) . which assumes that the 
velocity distribut ion is close to Gau ssian, the gap method 
(jZabludoff et al.lll990l: iBeers et al.lll990l) . and the adap- 



tive kernel method ()Pisaniiri993D . However, not all in- 
terlopers have large velocity separations from the clus- 
ter, as Figure H] illustrates. These interlopers are diffi- 
cult to identify and can generate a rather counterintu- 
itive systematic error: they can cause a slight underesti- 
mate, rath er than an overestimate, of the cluster velocity 
dispe rsion (|Cenlll997t iDiaferio etall 119991: iBiviano et~al] 
l2006l) . 

Rather than iterating over the velocity dispersion, like 
the 3cr clipping method does, one can iterate over the 
virial mass: at each step, one removes the galaxy that 
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Figure 10. Ratios Mcat/Mtruc between tiie cluster mass estimated with various methods and the true mass derived from the N-body 
simulation. The mass estimators are applied to individual clusters within r200i 2r2oo and 3r2oo (top panel) and to the stacked clusters with 
A'^ = 50 and A'^ = 200 (bottom panels). Mest is estimated with the virial theorem (squares), the median (diamonds) or the average (solid 
circles) mass estimators; the interlopers are removed with the caustic location (grey), or the binary tree (violet). The error bars show the 
68 per cent range of the distributions. The open circles show the mass estimated with the caustic technique on the full particle sample, 
because the caustic technique mass is not affected, in principle, by the presence of interlopers. 



causes the largest mass variation (iPerea et al.ll990l). The 
projected and virial mass estimators (jHeisler et al.lll985() 
are sensitive to the presence of interlopers in different 
ways; comparing th eir mass values can also be used to 
identify interlopers (jWoitak et al.|[2007l) . Iterating over 
the mass provides more robust results than iterating 
over the velocity dispersion, because interlopers can af- 
fect more the estimate of the size of the cluster, which 
enters the mass estimate, than the velocity dispersion 
(jPiaferio et al. 1999). 

When galaxy catalogs survey large cluster regions, 
the methods described above can be extended and ap- 
plied to galaxy subsamples separate d into bins of pro - 
jected distances to the cluster center (|Fadda et al.lll996l ). 
Thus, the velocity distribution assumed to be Gaussian 
at each radius can have different widths at different radii 
(jPrada et al.l 120031 ). or the velocity dispersion in the So- 
clipping method can be derived at different radii by solv- 
ing the Jeans equatio n for a steady sta te system and 
isotropic galaxy orbits (jLokas et al.ll2006l ). 

A step forward an interloper rejection algorithm based 
on a dynamical approach derives from the consideration 
that from an extensive galaxy sample we can actually ex- 
tract information on the depende nce of the escape veloc- 
ity on the clustrocentric distance (|den Hartog fc KatgertI 



|1996[) . However, most algorithms derive a mass profile, 
that yields the escape velocity as a function of the pro- 
jected distance from the cluster center, by assuming that 
the system is in dynamical equilibrium. The final solu- 
tion must thus be obtained by iteratively removing the 
identified interlopers until the mass profile converges. 

Here, we have shown the performance of the caustic 
technique, used as an interloper rejection algorithm. The 
technique only uses the number density distribution of 
galaxies in the redshift diagram to estimate the escape 
velocity profile from the system. The caustic technique 
does not assume dynamical equilibrium and is indepen- 
dent of the mass profile estimate. Therefore, any itera- 
tion is unnecessary. 

In the presence of extensive surveys, interloper rejec- 
tion algorithms based on the estimate of the mass or 
on the escape velocity usually perform better than al- 
gorithms solely based on the velocity dist r ibution (e.g. 
Woitak fc Lokai [20071: IWoitak et all 120071 iWhite et al.l 
' T (l200l ' 



20101) . IWoitak et al.l 



perform an extensive com- 
parisons of a number of different alg orithms. T hey con- 
clude that the method by [den Hart og fc Katger t (199^ 
is the most effective at removing interlopers, produc- 
ing samples with average contaminations fi in the range 
0.02 — 0.04 within one projected virial radius. 
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These values are in perfect agreement with our median 
fi — 0.02 (Table [1]). However, there are two noticeable 
differences between our analysis and theirs: the dynam- 
ical state of the clusters and the sample extension. The 
caustic technique is independent of the dynamical state 
of the cluster, and in fact, we only adopt the cluster mass 
as the criterion to build our sample of 100 simulated clus- 
ters. O n the contrary , in th eir sample of 10 simulated 
clusters. IWoitak et al.l ()2007[) pay particular attention to 
only include relaxed syste ms that have no sign o f ongo - 
ing mergers, because the Iden Hartog fc KatgertI ()1996l ) 
method requires dynamical equilibrium to be effective. 
Merging clusters requir e more sophisticated approaches 
(see, e.g., IWegneii 120111 and references therein), like the 
caustic technique. 

In addition, the dynamical equilibrium assumption 
clearly limits the analysis to projected radii smaller than 
the virial radius, where the equilibrium is expected to 
hold, whereas the caustic technique enables the identifi- 
cation of interlopers to much larger radii. We find that, 
with the caustic technique, the median contamination 
increases to fi = 0.047 and 0.080 at 2r2oo and 3r2oo, re- 
spectively, with a median completeness fc that remains 
larger than ^ 0.95. No other methods that remove in- 
terlopers in these regions are currently available. 

When we use the caustic technique to identify inter- 
lopers, the virial mass estimator returns a mass over- 
estimated bjM^_ge£_ceirtwithin r2ooj similarly to the 
results of iBiviano et al.l ()2006f ). who remov ed interlopers 
with a combination of the gap procedure (jGirardi et al.l 
I1993D and the method of den Hartog and K atgert 
(|Katgert et"alll2004t Iden Hartog fc KatgertI [19960 from 
mock clusters with more than 60 members. This bias is 
not present when we use the median and average mass 
estimators. At radii larger than ^200, where only the 
caustic technique can be used to remove interlopers, all 
mass estimates are unbiased. 

7. CONCLUSION 

We have tested the ability of the caustic technique to 
identify the cluster galaxy members using 100 galaxy 
clusters with mass M200 > IO^'^/i-^Mq extracted from 
a cosmological A^-body simulation of a ACDM universe. 
Our mock catalogues contain 1000 galaxies in the field 
of view of 12h~^ Mpc on a side at the cluster loca- 
tion. The true three-dimensional members, defined as 
the gravitationally bound galaxies, are compared to the 
galaxies identified as members with the caustic tech- 
nique. We find a completeness of fc — 0.95 ± 0.03 
within 3'r200j whereas the contamination increases from 

fi = 0.020t°;°t5 at r2oo to /, = O.OStVol at 3r2oo. The 
lack of spherical symmetry in clusters of galaxies causes 
most of the spread of the completeness and the contam- 
ination profiles. In fact, when applying the technique 
to samples built from a spherically symmetric stacked 
cluster the spreads decrease by at least a factor of two. 
No other technique for the identification of the members 
of a galaxy cluster provides such large completeness and 
small contamination at these large radii. 

The mass estimated with the virial theorem within 
3^200 after removing interlopers in the case of individual 
clusters is unbiased and within 30 per cent of the ac- 
tual mass. The use of the spherically symmetric stacked 
cluster decreases the spread to less than 10 per cent. 



The increa.sing a mount of data in clusters of galaxies 
(|Geller et al.l I2OIII) requires adequate tool for extract- 
ing the information they contain and properly comparing 
them with the output of the galax y formation mod elling 
that is increasingly sophisticated ( Saro et aI1l2012j ). 

The caustic technique can provide accurate estimates 
of the dark matter distribution in the outer regions of 
galaxy clusters and information on the dynamical con- 
nection between galaxies and clusters. The first piece of 
information is relevant for our understanding of the for- 
mation of cosmic structure and can even constrain the 
prope rties of dark matter (iSerra fc Dominguez Romerol 
I2OIID and the theory of gravitv (iLam et akboilT 

Determining the membership of galaxies in the out- 
skirts of clusters is unique to the caustic method. Ap- 
plying the algorithm to a large sample of clusters can 
provide the first accurate measure of how the gradients 
of properties of the cluster galaxy population, like color 
and star formation rate, merge into the field. In addition, 
it might provide the first determination of galaxy mem- 
bership in the filaments surrounding clus ters that repre- 
sent t he preferred pat h of m as s acc retion (" Pimbblet et al.l 
20041 ,Colbcrg et al ] [20051: [Aragon -Ca lvo et al l I20'T( 



Gonzalez fc Padillall2dlol )7 this piece of information can 



thus enlighten the connection between the formation of 
galaxies and the large-scale structure. In future work, we 
will investigate the potential impact of these measures. 
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